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SECRETORY  POLYPEPTIDES  ENCODED  BY  BALBIANI  RING  GENES 
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Project  Summary 

The  aim  of  this  project  is  to  learn  about  the  structure, 
developmentally  regulated  synthesis  and  assembly  of  a  family  of  secretory 
proteins  (SPs)  into  an  insoluble  polymer  of  silk-like  threads.  SPs  are 
exclusively  synthesised  in  salivary  glands  of  aquatic  larvae  of  the  Dipteran, 
Chironomus.  AU  SPs  studied  to  date  are  composed  of  tandemly  repeated  amino 
acid  sequences.  Recombinant  cDNA  probes  are  used  to  map  SP-coding  genes  on 
polytene  chromosomes,  identify  their  mRNAs  on  Northern  blots  and  derive  the 
amino  acid  sequence  of  their  encoded  protein.  cDNA  probes  and  anti-SP 
antibodies  are  used  to  study  the  level  at  which  S?  gene  regulation  occurs 
during  larval  development  and  under  conditions  of  galactose-induced  alterations 
in  gene  expression.  SF  assembly  in  vitro  is  being  studied  by  a  combination  of 
physical,  electron  microscopic  and  biochemical  raethods.  We  hope  to  learn  which 
SPs  interact  with  each  other  and  v.'hat  is  the  chemical  nature  of  these 
interactions.  We  eventually  plan  to  determine  the  spatial  distribution  of  SPs 
within  assembled  complexes  by  making  three-dimensional  tomographic 
reconstructions  from  nnmunoelectron  micrographs.  This  experimental  system 
provides  a  unique  opportunity  to  study  how  naturally  occurring  soluble  proteins 
can  assemble  into  an  insoluble  fiber  that  functions  in  an  aqueous  environment. 

Results  from  the  Prior  Year 

cDNA  clones  for  additional  SPs.  Manuscripts  describing  the 
identification,  partial  structure  and  expression  of  genes  for  spl40  (Dignam  et 
al.  .  1989)  and  spl85  (Dignam  and  Case.  1990)  are  attached.  During  the  past 
year,  another  partial  cDNA  clone  (F4a)  has  been  sequenced.  While  the  1-kb 
sequence  lacks  landmarks  of  other  .SP-coding  cD'JAs  (tandemly  repeated 
sequences,  conserved  Cys  residues,  *  Pro  -  motifs!,  it  hybridizes  to  a  4.8-)ib 
poly:;A)+  RNA  that  in  salivary  glands  appears  to  be  as  .abundant  as  SP-coding 
mRNAs  .  To  identity  the  protein  encoded  by  this  mRNA  we  are  raising 
antibodies  against  a  15-residue  synthetic  peptide  whose  sequence  was  derived 
from  the  cDNA.  We  are  experimenting  v-nch  an  approach  developed  by  Temmi  and 
co-workers  whereby  peptide  conjugation  to  a  carrier  protein  is  rircumv-nted  by 
initiating  peptide  .synthesis  on  a  branohed-chain,  octavalent  c.ore  of  Lys 
residues . 

In  ^dtro  assembly/chsassembly  of  SPs.  A  manuscript  describing  the 
structure  and  m  vitro  disassembly /^reassembly  of  SPs  into  macromolecular 
complexes  has  beer,  published  (Wellmari  and  Case,  1989).  We  demonstrated  that 
purified  spis  (the  1000-i;Da  SPs)  could  assemble  into  complexes  'with  both 
similar  morphology,  using  electron  microscopy,  and  a  similar  dichro-ic  spectrum 
to  that  of  unfractionated  native  complexes.  We  estimated  from  rv.antitat've 
dhehroism  measurements  that  spIs  were  .approxiir.ately  151  c-helix,  28-301 
'-sheet.  26-2St  B-turn  and  25'4  other  secendary  structure.  During  the  past 
year  w-  syr.the.sized  and  purified  peptides  correspendino  to  the  alternating 
CONSTANT  :C)  and  SUEREFTAT  (SR)  domains  of  spDs .  C  arc!  SP  peptide.s 
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were  subjected  to  circular  dichroism  and  Infrared  spectrometry.  Data  for  the  C 
peptide  are  consistent  with  predictions  of  its  secondary  structure;  it  consists 
primarily  of  a-helix.  The  SR  peptide,  however,  does  not  consist  of  13-structure, 
Instead  it  appears  to  form  a  left-handed,  S-^/poly (Gly )II-type  helix.  Finally, 
the  Amide  A  frequencies  suggest  that  both  u  and  SR  peptides  may  form  sheets 
of  parallel  helices  rather  than  multistranded,  supercoiled  structures. 

Interspecific  comparisons  of  SPs.  V/e  mitlated  a  comparison  of  SP 
structure  among  related  but  different  specie.?  of  Chironomus .  Hybridisation  and 
antibody  probes  have  been  exchanged  with;  Professor  J.-E.  Edstrom's  lab 
(Lund,  Sweden)  which  studies  ^  paUidivittatus ;  Professor  1. 1.  Kiknadze’s  lab 
(Novosibirsk,  U.S.S.R.)  which  studies  ^  thummi .  Aside  from  identifying 
homolgous  S?  proteins  in  each  species,  we  hoped  that  species-specific 
differences  in  the  synthesis  of  SPs  might  provide  clues  to  the  role  of  each 
protein  in  silk  fiber  assembly.  While  working  in  Kiknadze's  lab  as  a 
U.S. -U.S.S.R.  National  Academies  of  Sciences  Exchange  Scientist,  I  made  an 
interesting  observation.  Silk  fibers  spun  by  larvae  from  each  species  are 
distinguishable  microscopically.  The  morphciogicai  differences  that  v/ere 
observed  correlate  with  the  presence  of  a  cell- specific  SP  that  is  not 
synthesized  by  ^  tentans  larvae. 

Plans  for  Next  Year 


S? -coding  genes.  We  will  continue  to  identify  and  map  additional 
SP-coding 'genes  using  recombinant  DNA  procedures  which  have  been  successful 
to  date.  Our  first  priority  will  be  to  determine  if  the  F4a  antipeptide  antibody 
reacts  with  an  SP  on  Western  blots.  If  it  doss,  we  will  continue  to  study  the 
structure  and  expression  of  this  mRNA.  Our  next  priority  will  be  to  construct  a 
cDNA  expression  libraiy  from  thummi  salivary  gland  RNA.  We  want  to  clone 
the  cDNA  for  the  cell-specific  SP  mentioned  above. 

Structural  organization  of  S?  complexes.  We  will  continue  to  study 
SP-3P  interactions  focusing  on  the  C  and  SR  domains  of  spis.  Biophysical 
studies  of  synthetic  C  and  SR  peptides  will  continue.  Their  assembly 
characteristics  will  be  compared  to  native  spIs  purified  by  glycerol  gradient 
cer.trifug.ation .  x^'e  hope  to  identify  one  or  more  sites  within  these  domains  thiat 
are  responsible  for  ^  vitro  assembly.  For  example,  preliminary  data  suggest 
that  C  peptides  can  dimerize  by  formation  of  a  disulfide  bond.  Peptide  mapping 
will  be  used  to  determine  whether  this  disulfide  bond  involves  a  specific  or 
random  pair  cf  Cys  residues.  Similar  studies  will  be  extended  to  spl  complexes 
formed  in  vitro  and  3P  com.plexes  formed  in  vivo . 
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Secretory  proteins  are  synthesized  in  salivary  glands 
of  the  insect,  Chironomus  tentans,  and  assemble  in  vivo 
into  silk-like  threads  which  aquatic  larvae  use  to  con¬ 
struct  tubes  for  filter  feeding  and  pupation.  Thus  far, 
all  known  secretory  protein  genes  contain  repetitious 
protein-coding  sequences  and  are  located  in  cytological 
structures  known  as  Balbiani  rings,  giant  puffs  found 
on  polytene  secretory  cell  chromosomes.  In  this  paper 
we  describe  the  identiHcation  of  another  secretory  pro¬ 
tein  gene  which  is  comprised  of  repeated  sequences; 
however,  this  gene  is  not  located  in  a  Balbiani  ring. 
Two  partial  cDNA  clones  from  a  3.6-kilobase  pair 
poly(A)'^  RNA  were  sequenced  and  found  to  contain 
two  open  reading  frames  for  protein  synthesis.  Anti¬ 
bodies  were  raised  against  synthetic  oligopeptides 
whose  sequences  were  derived  from  these  two  open 
reading  frames.  An  immunoafAnity-purified  antibody 
for  one  of  these  peptides  bound  specifically  to  a  140- 
kDa  secretory  protein  (spl40).  The  cDNA  sequences 
contain  tandem  repeats  of  42  base  pairs  which  encode 
a  repeat  of  14  amino  acids  with  a  composition  and 
oligopeptide  sequence  similar  to  other  secretory  pro¬ 
teins.  The  C.  tentans  genome  contains  about  70  copies 
of  this  42-base  pair  repeat  organized  as  a  contiguous 
block  of  3  kilobase  pairs  or  less.  The  spl40  gene  was 
mapped  by  in  situ  hybridization  to  polytene  chromo¬ 
some  band  I-17-B.  Developmental  studies  of  protein 
accumulation,  steady-state  levels  of  mRNA,  and  rela¬ 
tive  transcription  rate  suggested  that  the  spl40  gene 
is  developmentally  regulated  so  that  maximal  expres¬ 
sion  is  achieved  during  the  prepupal  stages  of  the 
fourth  larval  instar.  Based  upon  these  results  we  pro¬ 
posed  that  spl40  gene  belongs  to  a  prepupal  class  of 
secretory  protein  genes.  While  the  spl40  gene  shares 
structural  and  expression  characteristics  with  other 
secretory  protein  genes,  its  unique  chromosomal  loca¬ 
tion  shows  that  this  multigene  family  is  not  restricted 
to  Balbiani  rings. 


*  This  research  was  supported  by  Office  of  Naval  Research  Con¬ 
tract  N00014-87-K-0387  (to  S.  T.  C.)  and  a  grant  from  the  Swiss 
National  Science  Foundation  (to  M.  L.).  The  costs  of  publication  of 
this  article  were  defrayed  in  part  by  the  payment  of  page  charges. 
This  article  must  therefore  be  hereby  marked  "advertisement"  in 
accordance  with  18  U.S.C.  Section  1734  solely  to  indicate  this  fact. 
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to  the  GenBank^'^/EMBL  Data  Bank  with  accession  numbetis) 
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The  organization  and  chromosomal  distribution  of  eukar¬ 
yotic  multigene  families  is  paradoxical.  Some  gene  families 
(1-9)  are  dispersed  throughout  the  genome,  while  others  (10- 
16)  are  organized  as  gene  clusters  on  one  or  more  chromo¬ 
somes.  In  some  instances,  the  organization  and  distribution 
of  gene  clusters  may  be  important  for  coordinate  regulation 
of  the  expression  of  its  constituent  genes.  For  example,  the 
spatial  distribution  of  genes  in  the  globin  cluster  reflects  their 
temporal  pattern  of  expression  during  development  (12).  Cer¬ 
tain  histone  gene  clusters  represent  different  expression 
classes  of  genes  (10).  In  other  instances,  there  is  no  apparent 
pattern  of  expression  for  genes  witnin  a  cluster. 

The  chromosomal  distribution  of  the  secretory  protein  mul¬ 
tigene  family  in  Chironomus  contributes  to  this  paradox.  For 
example,  the  spl‘  class  of  genes  (17-19  and  references  therein) 
consists  of  four  members  which  encode  structurally  similar 
secretory'  proteins  (spla.  splb,  spic,  and  spid).  Each  protein 
has  a  molecular  mass  of  about  1000  kDa  and  is  largely- 
composed  of  complex  core  repeats  which  can  be  divided  into 
two  regions.  The  constant  (C)  region  is  35-45  amino  acids  in 
length  and  contains  4  cysteine,  1  methionine,  and  1  phenyl¬ 
alanine  residues  which  are  invariant.  The  subrepeat  (SR) 
region  contains  four-six  direct  repeats  of  6-12  amino  acid 
sequences  which  contain  a  characteristic  tripeptide  motif:  a 
positively  charged  residue  (lysine  or  arginine)  followed  by 
proline  followed  by  a  negatively  charged  residue  (glutamic 
acid,  aspartic  acid,  or  phosphoseiine),  henceforth  designated 
as  ©  Pro  ©.  These  alternating  C  and  SR  regions  are  thought 
to  be  important  sites  for  protein-protein  interactions  which 
take  place  when  secretory  proteins  assemble  into  silk-like 
threads  (18,  19).  Larvae  spin  these  threads  to  construct  un¬ 
derwater  tubes  for  filter-feeding  and  pupation  (20,  21).  All  spl 
genes  are  located  in  cytological  structures  known  as  Balbiani 
rings  (BRs),  and  their  distribution  on  polytene  chromosomes 
reflects  their  differential  expression  in  response  to  galactose 
(22-26):  BRl  on  chromosome  IV  contains  the  spla  gene  whose 
expression  is  generally  unaffected  by  galactose:  BR2  on  chro¬ 
mosome  IV  contains  the  splb  and  spId  genes  both  of  which 
are  repressed  by  galactose;  BR6  on  chromosome  III  contains 
spIc.  the  only  gene  which  is  induced  by  galactose. 

BRl.  like  BR2.  also  contains  a  cluster  of  secretory  protein 
genes.  In  addition  to  the  spla  gene,  it  contains  the  gene  for 


'  The  abbreviations  used  are:  spl.  a  1000- kDa  secretory  protein; 
sp“x''.  other  secretory  proteins  with  an  apparent  M,  equal  to  "x";  BR. 
Balbiani  ring;  sp.  secretoiy  protein;  C  region,  constant  region  of  an 
spl  core  repeat:  SR  region,  subrepeat  region  of  an  spl  core  repeat: 
ORF.  open  reading  frame;  bp,  base  pair:  kb,  kiioha.se  pairs. 
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